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Description 



Methodjbr the characterisation of nucleic acid molecules 

Technical Field 

This invention relates to a method for the rapid and sensitive 
5 characterisation of nucleic acids, especially DNA, including the 

identification and detection of genetic variations in a target nucleic acid. 

Background Art 

Detection of specific sequences at given loci in nucleic acids is 
central to many diagnostics procedures, notably in the diagnosis of 
10 inherited disorders and of infectious diseases. On-going requirements 
for such diagnostics procedures include optimising high sample 
throughput and sensitivity and specificity. 

WO 97/03210 describes a method for rapidly detecting the 
presence or absence of a particular nucleic acid sequence at a candidate 

15 locus in a target nucleic acid sample which comprises the steps of: i) 
introducing a modified base which is a substrate for a DNA glycosylase 
into the candidate locus at one or more preselected positions; ii) excising 
the modified base by means of said DNA glycosylase so as to generate 
an abasic site; iii) cleaving phosphate linkages at abasic sites generated 

20 in step ii); and iv) analysing the cleavage products of step iii) so as to 
identify in said target nucleic acid sequence the presence or absence of 
the particular nucleic acid sequence at the candidate locus. 
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WO 99/54501 describes a method for characterising nucleic acid 
molecules which involves generating extendible upstream DNA 
fragments which result from the cleavage of nucleic acid at an abasic site 
involving the use of DNA glycosylase. The extendible upstream 
5 fragment is incubated in the presence of an enzyme, for example a 
polymerase or a ligase, allowing for extension thereof and template 
nucleic acid followed by analysis of the resultant fragment(s). This 
method can be used to scan or check a fragment of DNA for the presence 
or absence of a mutation. 

10 Any method which reduces the time taken, while at the same time 

improving sensitivity, for characterising specific sequences would have 
significant advantages in the detection of mutations and also for 
genotyping known polymorphisms and mutations. 

Disclosure of Invention 

15 The invention provides a method for characterising nucleic acid 

molecules, which comprises the steps of: 

i) introducing a modified base which is a substrate for a DNA 
endonuclease into a target DNA molecule; and 

ii) reacting the nucleic acid containing the modified base with 
20 said DNA endonuclease such that the nucleic acid is 

cleaved to generate an upstream fragment containing said 
modified base and bearing a 3' hydroxyl group. 
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The nucleic acid can be characterised at this stage by analysing the 
cleavage products in a manner known per se. 

The invention has advantages over the prior art methods described 
above, including the fact that a secondary enzymatic or chemical 
5 cleavage step is not required, thereby allowing higher throughput and 
more cost-effective DNA characterisation. Thus, the method according 
to the invention involves endonuclease mediated cleavage rather than 
glycosylase mediated cleavage as in the case of the prior art methods 
described above. 

10 In the method according to the invention the modified base as 

such targets cleavage of the nucleic acid strand because it directs the 
endonuclease to cut the nucleic acid strand on which it resides at a site 
predominantly one nucleotide removed downstream therefrom, so that 
the modified base remains in the nucleic acid as the upstream fragment. 

15 Therefore, if an upstream fragment is generated that is 25 nucleotides 
long, then the modified base is at position 24 (counting 5 1 to 3 ') of said 
fragment. 

Furthermore, the method according to the invention enables one to 
detect polymorphisms in any sequence context and such detection is not 
20 dependent on surrounding nucleotide sequence. 

Preferably, the modified base is introduced by enzymatic 
amplification of the nucleic acid. 
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In this embodiment, the target nucleic acid is preferably amplified 
using normal DNA precursor nucleotides and at least one modified 
precursor nucleotide. 

The term "amplifying" as used herein refers to any in vifro process 
5 for increasing the number of copies of a nucleotide sequence or 

sequences. Amplification of a target sample results in the incorporation 
of precursor nucleotides into the DNA being amplified. 

Typically, amplification of a target sample is earned out using the 
polymerase chain reaction (PCR) using a combination of normal 
10 precursor nucleotides and modified precursor nucleotides. Normal 

precursor nucleotide(s) refer to the deoxyribonucleotides dATP, dCTP, 
dGTP and dTTP. 

Modified precursor nucleotide(s) as used herein refers to a 
modified nucleotide or nucleotides that can be incorporated into a 
15 nucleic acid so that a substrate base or bases (endonuclease substrate 

base) is generated which is recognised by a DNA endonuclease enzyme. 

According to one embodiment, a nucleotide containing the 
modified base partially replaces a normal precursor nucleotide. 

According to an alternative embodiment, a nucleotide containing 
20 the modified base totally replaces a normal precursor nucleotide. 

The amplification will typically involve amplifying a target 
nucleic acid using a combination of normal DNA precursor nucleotides 
and one or more modified precursor nucleotide(s) where a modified 
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precursor nucleotide replaces all or a proportion of one of the normal 
precursor nucleotides. 

Amplification of a nucleic acid using a modified precursor 
nucleotide to totally replace one of the normal precursor nucleotides 
5 results in the incorporation of an endonuclease substrate base into DNA 
in place of one of the four normal bases G, A,T,C. This permits cleavage 
of a primer extended in an amplification reaction at the first position 3 1 of 
the primer where a normal base is replaced by modified base. 

Following the amplification process, the amplified product is 
10 treated with a suitable DNA endonuclease enzyme which recognises the 
modified base and cleaves the DNA phosphodiester at or near the site of 
incorporation of said modified base. Prior to treatment with a suitable 
DNA endonuclease, double stranded DNA may be treated with 
exonuclease L This treatment serves to digest any unused primers and 
15 any non-specific single stranded DNA amplification products, thus 
improving the signal to noise ratio. However, this is not an absolute 
requirement for the method according to the invention. 

In the case where the modified precursor nucleotide is dITP, the 
modified base inosine will be generated at guanine positions in the 
20 amplified target nucleic acid sample. Addition of Endonuclease V to the 
sample cleaves the DNA predominantly at the second phosphodiester 
bond 3 f to the inosine, however the inosine residue is not removed from 
the DNA by the enzyme. 
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Although inosine has been termed and used as a universal base in 
that it can substitute for all four normal bases and base pairs with all four 
normal bases, it primarily acts as a substitute for guanine. Therefore in 
the method according to the invention, inosine primarily substitutes for 
5 guanine. 

In the case where the modified precursor nucleotide is dUTP, the 
modified base uracil will be generated at thymine positions in the 
amplified target nucleic acid sample. Addition of Endonuclease V to the 
sample also cleaves the DNA at the second phosphodiester bond 3' to the 
10 uracil base, however the uracil residue is not removed from the DNA by 
the enzyme, as discussed further below. 

According to a further alternative embodiment, the modified base 
is introduced by chemical modification of an existing base. 

The modified base can be introduced by chemical modification of 
15 a nucleic acid. Several methods exist where the treatment of DNA with 
specific chemicals modifies existing bases so that they are recognised by 
specific DNA endonuclease enzymes. For example, treatment of DNA 
with sodium bisulfite causes deamination of cytosine residues in DNA to 
form uracil residues in DNA, which target Endonuclease V cleavage of 
20 the DNA strands containing uracil residues. Treatment of DNA with 
oxidising agents, generates several modified bases including inosine, 
caused by deamination of adenine. Thus, bases present in the nucleic 
, acid can be converted into modified bases by chemical means. 



1 I 

* , WO 2004/003232 PCT/IE2003/000098 

7 

The modified base is preferably selected from inosine 
(hypoxanthine) and uracil. 

The preferred modified precursor nucleotides dITP and dUTP 
when incorporated into DNA generate the endonuclease substrate bases, 
5 inosine and uracil, respectively. 

The modified precursor nucleotide dUTP is a base sugar 
phosphate comprising the base uracil and a sugar phosphate moiety. 

The modified precursor nucleotide dITP is a base sugar phosphate 
comprising the base hypoxanthine and a sugar phosphate moiety. 

10 Amplification of a target nucleic acid using the precursor 

nucleotides dATP, dCTP, dGTP, dTTP and a specified amount of the 
modified precursor nucleotide dITP results in an amplified DNA where 
guanine is preferentially replaced randomly by hypoxanthine. The 
hypoxanthine is incoiporated in the newly synthesised DNA strand at 

15 positions complementary to cytosine residues in the template DNA 
strand during the amplification process when the other precursor 
nucleotides are not limiting. 

Amplification of a target nucleic acid using the precursor 
nucleotides dATP, dCTP, dGTP, dTTP and a specified amount of the 
20 modified precursor nucleotide dUTP results in an amplified DNA where 
thymine is replaced randomly by uracil. The uracil is incorporated in the 
newly synthesised DNA strand at positions complementary to adenine 
residues in the template DNA strand during the amplification process. 
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More than one type of modified nucleotide can be incorporated. 
For example, dUTP can replace dTTP and dITP can replace dGTP. 

In one embodiment of the invention, the endonuclease is a 
thermostable endonuclease. 

5 Thermostable endonucleases are available from several 

thermostable organisms including Endonuclease V - like activity from 
Thermatoga maritima and Archeaglobas fulgidus. 

A DNA endonuclease as used herein cleaves the DNA backbone 
at or near the site of the endonuclease substrate base but predominantly 
10 at the second phosphodiester bond 3' of the modified base. When the 
endonuclease cleaves the DNA backbone at or near all the sites of 
incorporation of the endonuclease substrate base many different 
fragments are generated. 

The method of cleavage is referred to herein as endonuclease- 
1 5 mediated cleavage. 

A specific example of an endonuclease as used herein is 
Endonuclease V from E. coli (also called Deoxyinosine 3'- 
Endonuclease) which cleaves at the second phosphodiester bond on the 
3 1 side of the endonuclease substrate base. This is distinct from some 
20 other endonucleases that cut at the site of the base and also distinct from 
glycosylases which excise glycosylase substrate bases. It should be noted 
that some endonuclease substrate bases can also be glycosylase substrate 
bases. For example, uracil and inosine are endonuclease substrate bases 
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which target cleavage of DNA by Enonuclease V. However, these bases 
are also substrates for Uracil DNA glycosylase and 3-methyl-adenine 
glycosylase, respectively. Endonuclease V cleaves 3' or downstream of 
the modified base, therefore an upstream fragment is generated that 
5 contains the modified base. 

Fragments of differing lengths are therefore produced if there are 
differences in the sequence of the DNA due to additional or reduced 
incorporation of an endonuclease substrate base. 

The method according to the invention is distinguishable from the 
10 methods described and claimed in WO 97/032 10 and WO 99/54501 in 
that in the case of the cleavage step of the present invention a single 
enzyme recognises and cleaves the DNA in a single step, in contrast with 
the requirement for two separate enzymes or an enzyme and a chemical 
agent in the case of the prior art methods. In addition, in the case of the 
15 present invention, the modified base remains in the upstream fragment, 
whereas, in the case of the prior art, the modified base is excised from 
the DNA. 

In one embodiment of the invention, the endonuclease cleavage 
reaction is carried out concurrently with a thermocycled amplification 
20 reaction using a thermostable endonuclease. 

In one embodiment of the invention, one of the initial primers, 
termed the diagnostic primer, is placed adjacent to a locus where a DNA 
sequence variation such as a polymorphism or a mutation occurs, so that 
the first modified base incorporated into the extended primer is at the 
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mutation locus, then theinitial primer will be extended during the 
amplification reaction and then cleaved to a specific length depending on 
whether or not a mutation is present at the mutation locus following 
amplification and endonuclease-mediated cleavage. The fragments 
5 generated following the initial extension and endonuclease-mediated 
cleavage, containing the modified base are referred to as upstream 
fragments. The primers prior to extension are referred to as the initial 
primers in this embodiment, one of which is termed the diagnostic 
primer. 

10 Thus, in an important aspect of the invention, at least one of the 

primers for the amplification step, i.e. the diagnostic primer, is 
positioned adjacent a locus where a DNA sequence variation occurs. 

In a further embodiment of the invention, the diagnostic primer is 
placed adjacent to a locus where a DNA sequence variation such as a 

15 polymorphism or a mutation occurs (upstream), so that the first modified 
base incorporated into the extended primer is after the polymorphic locus 
(downstream), then the initial primer will be extended and then cleaved 
to a similar length but will have a different molecular mass or 
hydrophobicity depending on whether or not a polymorphism is present 

20 at the polymorphic locus following amplification and endonuclease- 
mediated cleavage. 

This mass or hydrophobicity difference is readily determined by 
analysis on a mass spectrometer or HPLC, respectively. Alternatively, 
the sequence difference can be determined by differential hybridization 
25 of the upstream fragments. 
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In a still further embodiment of the invention, the method herein 
defined can be used to scan a stretch of nucleic acid for sequence 
variations. In this embodiment the initial primers used to amplify the 
stretch of nucleic acid, for example in PCR, are placed surrounding the 
5 area to be screened for genetic variations such as a polymorphism or a 
mutation so that the modified base is incorporated into the extended 
primer at all positions of the normal nucleotide that it is replacing. 
Endonuclease mediated cleavage of the amplified DNA will yield a 
population of fragments of various lengths dependent on the position of 
10 the replaced normal nucleotide in the original nucleic acid and also 
dependent on the sequence of the genetic variation. Therefore, the 
presence or absence of a sequence variation is determined by a gain or 
loss of one or more of the fragments or, indeed, a change in mass or 
hydrophobicity of one or more fragments. 

15 When a proportion of a normal precursor nucleotide is replaced 

with a modified precursor nucleotide in the amplification reaction, 
endonuclease mediated cleavage of the amplified DNA will yield a 
population of upstream fragments of various lengths, since different 
molecules will be cleaved at different points depending on where the 

20 modified precursor nucleotide was incorporated. The length of each 

fragment is determined by the position of incorporation of the modified 
precursor nucleotide during extension from the 3' end of the primer. 

When a modified precursor nucleotide replaces a proportion of 
one of the normal precursor nucleotides, preferably the ratio of the 
25 modified precursor nucleotide to the normal precursor nucleotide that it 
is replacing is such that an optimum of one modified precursor 
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nucleotide is incorporated per strand of amplified DNA. This allows 
subsequent cleavage of the amplified DNA strand into two fragments 
following action of an endonuclease that is specific for that modified 
base. A higher ratio of modified base to normal base is used to generate 
5 more than one cleavage site per amplified DNA strand. 

The presence of a sequence variation or mutation results in the 
appearance or disappearance of a cleavage fragment as judged by 
comparison with the known DNA sequence of the amplified molecule. 
Size and/or mass analysis of the fragments allows the precise location 

10 and sequence of the mutation in the target nucleic acid sample to be 
determined. Therefore, if a sequence variation occurs such that an 
additional site of modified precursor nucleotide incorporation is 
generated, an additional cleavage fragment will be observed upon 
analysis of the ladder of cleavage products. If a sequence variation 

15 occurs such that a site of modified precursor nucleotide incorporation is 
lost, the corresponding cleavage fragment will not be observed upon 
analysis of the ladder of cleavage products. 

If the sequence variation occurs such that it does not cause the 
gain or loss of a modified nucleotide incorporation site, the variation can 
20 be detected and determined by the change in mass of the upstream 

fragment that contains said variation. Alternatively, said variation can 
be detected by observing the altered electrophoretic mobility of said 
upstream fragment under non-denaturing conditions. 
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Direct detection of upstream fragments can be achieved by a 
variety of means. The initial primer used in the extension may be 
suitably labelled. 

Labelling of the primers can be performed by a variety of means 
5 including addition of a radioactive, fluorescent, or detectable ligand to 
the primer during or post primer synthesis. The use of a labelled 
precursor nucleotide or dideoxy terminator nucleotide in any of the 
extension reaction facilitates detection of the upstream fragments. 

Direct DNA staining methods such as silver or ethidium bromide 
10 staining facilitate detection of all products after size separation, for 
example after separation by electrophoresis. 

All DNA fragments can be detected, including upstream 
fragments, by incorporation of a detectable ligand during the initial 
amplification reaction, and/or during a subsequent extension reaction, if 
15 such is carried out. Such labelling can be achieved by using labelled 
nucleotides during the polymerisation steps, for example, using 
fluorescent labelled nucleotides in a manner known per se. 

The products/cleavage pattern resulting from the endonuclease 
mediated cleavage of the amplified target nucleic acid sample may be 
20 analysed by existing DNA sizing methods, such as polyacrylamide gel 
electrophoresis or high performance liquid chromatography (HPLC). 
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When the method according to the invention is used for 
genotyping a known polymorphism, the associated upstream fragment 
determines the presence of one or other of the alleles. 

When the method according to the invention is used for scanning 
5 for polymorphism, comparison of the analysis with the wild type DNA 
allows determination of the presence or absence of a sequence variation 
in the target nucleic acid sample. 

Labelling of the upper primer prior to the amplification process 
facilitates detection of the upstream fragments derived from the upper 

10 primer. On the other hand, labelling of the lower primer prior to the 
amplification process facilitates detection of the upstream fragments 
derived from the lower primer. The use of labelled precursor nucleotides 
in the initial amplification process and / or during a subsequent extension 
reaction, when such is carried out, facilitates detection of all the 

15 fragments generated as a result of the endonuclease mediated cleavage of 
the amplified target nucleic acid sample. 

An alternative method for detecting the cleaved DNA fragments, 
including upstream and downstream fragments, is by hybridisation, 
using a probe which is a labelled nucleic acid fragment. For example, the 

20 upstream fragments of the amplified upper strand can be detected by 
hybridisation of a suitably labelled oligonucleotide that is 
complementary in sequence to the upper primer. Similarly, the upstream 
fragments of the amplified lower strand can be detected by hybridisation 
of a suitably labelled oligonucleotide that is complementary in sequence 

25 to the lower primer. 
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It will be appreciated that using certain conventional detection 
methods, no additional label is required in order to detect the DNA. Such 
a detection method would be mass spectrometry. Using this method, all 
cleavage fragments are detected directly and separated based on their 
5 mass. This method has the added advantage in that, for example, 
upstream fragments of the same length can be identified and 
distinguished on the basis that they have a different mass due to their 
different base composition. 

Another such detection method is UV absorption. 

10 The 3' terminal sequence of the upstream fragment is 

enzymatically synthesised herein and is directly related to the nucleic 
acid being characterised since the target nucleic acid has acted as the 
template for its synthesis. Thus the 3' end of the upstream fragment is 
complementary to the target nucleic acid. Accordingly, determination of 

15 the nature of the 3' end of the upstream fragment by any means allows 
characterisation of the nucleic acid from which it was derived. 

Such upstream fragments are generated so that they can be used as 
primers for a subsequent extension reaction, if required. 

Accordingly, the upstream fragment generated in step ii) can be 
20 used as a primer for a subsequent extension reaction. 

The extension can be carried out using a DNA polymerase. 

Alternatively, the extension can be carried out using a DNA 

ligase. 
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Thus, the nature of the sequence of the 3' end of upstream 
fragments can be determined by their ability to function as primers in a 
. subsequent extension reaction using a template nucleic acid. Essentially, 
such determination is based on the ability of the 3' end of the upstream 
5 fragment to hybridise to a selected template under selected conditions. 
Following partial or complete hybridisation the upstream fragment may 
be extended using a nucleic acid polymerase and nucleic acid precursors 
or selected combinations of same. It will be appreciated that multiple 
possibilities exist for the selection of template molecules. Nonetheless, 
10 the extension of the upstream fragment is a measure of its ability or lack 
of ability to hybridise to a selected template molecule and thus the 
determination of the nature of the sequence of the 3' end of upstream 
fragment is on this basis. 

Because the 3' terminus of the upstream fragments differs 
1 5 depending on whether a mutation was present or absent at the mutation 
locus, be it the ultimate 3' terminal nucleotide or, indeed, the last one to 
five or so 3' terminal nucleotides, analysis of the ability of the upstream 
fragment to function in a subsequent extension reaction using a template 
nucleic acid permits the determination of whether a mutation was present 
20 or absent at the mutation locus. Any naturally occurring or enzymatically 
or chemically synthesised template which fully or partially hybridises to 
the upstream fragment can be designed and / or selected as a template 
nucleic acid allowing the ability of the upstream fragment to function as 
a primer to be determined. 

25 As indicated above, the nature of the sequence of the 3' end of 

upstream fragments can also be detennined using extension by DNA 
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ligase. In this aspect of the invention, the upstream fragment and a 
capture or reporter oligonucleotide are hybridised adjacent to one 
another on a template DNA strand. The ligase forms a phosphodiester 
bond between the upstream fragment and the reporter oligonucleotide, 

5 provided that the upstream fragment and the oligonucleotide junction 
have correctly hybridised with the template, i.e. no mismatches or gaps 
present at the junction. Thus, an exact match between the 
oligonucleotides and the template DNA at the junction permits the 
ligation of the upstream fragment to the reporter oligonucleotide forming 

10 a larger product which is the cumulative size of both the 

oligonucleotides. This reaction could also be cycled in a ligase chain 
reaction resulting in the amplification of the larger ligated product. Thus, 
detection of the larger product shows that the 3 f end of the upstream 
fragment hybridised to the template strand at the junction, while the 

15 absence of a larger product indicates that there were differences between 
the upstream fragment sequence and the sequence of the template DNA. 

As indicated above, the endonuclease cleavage reaction can also 
be earned out simultaneously with a thermocycled polymerization 
reaction which incorporates the modified base into the DNA. This can be 

20 achieved by including a thermostable EndonucleaseV-like enzyme in the 
thermocycled amplification reaction. For example, one can carry out the 
initial amplification of target nucleic acid using only normal precursor 
nucleotides dGTP, dATP, dTTP and dCTP. The product of this reaction 
is then treated with a phosphatase to degrade all nucleotide triphosphates 

25 and the phosphatase is then heat denatured. Fresh nucleotides are then 
added, where at least one normal nucleotide is partially or completely 
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replaced by a modified precursor nucleotide, in addition to thermostable 
polymerase, thermostable endonuclease and additional diagnostic primer 
and a thermocycled amplification reaction is effected. In effect, during 
each cycle, the diagnostic primer is extended using the initial 

5 amplification product as template and modified nucleotide is 

incorporated into the extended primer, thus generating a substrate for 
cleavage by the thermostable endonuclease, thereby generating 
upstream/ diagnostic fragments. The diagnostic primer can be used as 
one of the original amplification primers or can be an alternative or 

10 nested primer. If the diagnostic primer is different from the initial 

amplification primers then an Exonuclease I step can also be included 
with the phosphatase treatment, thereby degrading all unused initial 
primers so that the diagnostic primer is the only primer present during 
the cycling reaction. 

15 It will be appreciated that the present invention has significant 

advantages over the prior art, in particular, the prior art hereinabove 
referred to. 

For example, the method according to the invention can be viewed 
as a single step reaction requiring just a single enzyme in the sense that 
20 no secondary cleavage enzyme or step is required, which will allow 
higher throughput and more cost-effective DNA characterisation. 
Upstream DNA fragments are directly extendible and no extra 
processing is needed. 

One can use a single enzyme to detect all possible single base 
25 polymorphisms, for example, Endonuclease V from specific sources can 
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cut at U and I sites, therefore both T to A (and A to T) and G to C (and C 
to G) polymorphisms can be detected. 

When using an Endonuclease V-type of enzyme, there is no 
absolute need for Exonuclease I treatment of the amplified product prior 
5 to endonuclease mediated cleavage since any diagnostic fragment will be 
larger than the unused primer, even if primer must be designed adjacent 
to the polymorphic site, since Endonuclease V cuts two phosphodiester 
bonds downstream of the polymorphic site. 

The method according to the invention allows one to carry out 
10 both genotyping and the discoveiy of polymorphisms. 

As indicated above, the method according to the invention permits 
detection of polymorphisms in any sequence context and said detection 
is not dependent of sunounding nucleotide sequence. 

The method according to the invention is not dependent on the 
15 presence of mismatches or external probes for detection of 
polymorphisms . 

Brief Description of the Drawing 

The accompanying figure is a diagrammatic representation of the 
method in accordance with the invention. 
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Modes for Carrying Out the Invention 

Example 1 

The method according to the invention was used to detect the 
5 presence ofaG to C SNP (single nucleotide polymorphism) at position 
1357 of the human pyruvate dehydrogenase kinase, isoenzyme 2 gene 
(PDK2) (Accession number L4245 1) in a patient heterozygous for that 
SNP. The sequence of steps is depicted in the accompanying figure. The 
lower strand shown in the figure is selected as the target strand and the 
10 presence or absence of a G nucleotide at the SNP site was determined. 
The G allele refers to that PDK2 allele bearing the G nucleotide at 
position 1357 (on the upper strand) and the C allele refers to the PDK2 
allele bearing the C nucleotide at position 1357 (on the upper strand). 

The diagnostic primer (in this case it is the lower primer) was 
15 synthesised with a 5' fluorescent label (denoted by *). The genomic 
DNA sample was amplified by PCR as follows: the reaction mix for 
PCR contained lOng genomic DNA from the patient, 0.2mM dATP, 
dCTP, dITP and dTTP, 1.5mMMgCl 2> 1 OmM Tris-HCl, pH8.3, 50mM 
KC1, 0.3 units JumpStart Taq Polymerase, 6pmoles of each primer in a 
20 total volume of 5ul. The reaction was then placed in a thermocylcer with 
a heated hood. 35 Cycles of 95° C for 30 seconds, 45° C for 60 seconds 
and 65° C for 300 seconds were carried out in a thermocycler followed 
by 65° C. for 300 seconds. The reaction mixture bearing the amplified 
DNA was then treated with Exonuclease I to digest any unincorporated 
25 primers remaining in the reaction mixture as follows: 1 ul of the PCR 
reaction with 2 units of Exonuclease I in a 5ul reaction volume at 37° C 



WO 2004/003232 



21 



PCT/IE2003/000098 



for 30 minutes. The exonuclease was subsequently heat inactivated by 
incubating the reaction at 80° C for 15 minutes. 

The reaction volume was increased to 20ul with the addition of 
H 2 0 and was purified by centrifugation through a CentriSep (CentriSep 

5 is a Trade Mark) DNA spin column (Princeton Separations). The 

purified DNA (20p.l) was then digested with 1 ul of Endonuclease V (0.4 
ug/ml preparation) in the presence of 2mM MgCl 2 . The reaction was 
incubated at 65°C for 30 minutes. 2[xl of the reaction was added to lOul 
of formamide and 0.2ul of ROX labeled size standard, which was heated 

10 to 95° C for 5 minutes. 

The reaction was loaded onto an ABI 3100 Genetic Analyzer 
containing POP-4 (POP is a Trade Mark) matrix and electrophoresed at 
15kV for 30 minutes. Analysis of the run showed that two cleavage 
products were present in approximately equal amounts. One product was 
15 29 nucleotides (n) in length, corresponding to the G allele, while the 
second product corresponding to the C allele, was 23 n in size, as 
described below. 

The 23n product could only have been generated if an inosine 
residue was incorporated in the lower strand at position 1357. Thus, the 
20 detection of a 23n fragment is diagnostic of the presence of a C residue 
at position 1357 on the upper strand, hence incorporation of an inosine 
residue as the complementary base on the lower strand and the 
Endonuclease V enzyme cutting the lower strand one nucleotide 3' (i.e. 
downstream) of the inosine residue. In the G allele, the first inosine 
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incorporated on the lower strand should be opposite the C nucleotide at 
position 1350 on the upper strand and should result in a 29n product 
since Endonuclease V enzyme cuts the lower strand one nucleotide 3 1 
(i.e. downstream) of the inosine residue. Thus the detection of a 29n 

5 fragment is diagnostic of the absence of a C residue at position 1357 on 
the upper strand, thereby indicating the presence of the other allele, 
namely G allele at this position. The 23n and 29n products were detected 
in roughly equal amounts indicating that the target nucleic acid was from 
an individual heterozygous for the G1357C polymorphism. When a 

10 patient homozygous for the G allele was analysed, then only a 29n 

product was detected. When a patient homozygous for the C allele was 
analysed, then only the 23 n product was detected. The relative intensity 
of the 29n product to the 23n product allows one to determine the 
relative levels of the normal and mutant allele in a target nucleic acid 

15 sample. This is especially useful for analyzing pooled DNA samples 
where there may be a large difference between the levels of each allele. 

To diagnose a G to C transversion, or vice versa, the target strand 
can be either strand of the target locus. The decision on which strand the 
diagnostic primer is designed against can be made based upon optimal 
20 primer design considerations and on the most desirable product lengths. 
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Example 2 

The method according to the invention was used to detect the 
presence of a G to A SNP at position 380 of the Synaptopysin gene from 
5 Atlantic cod {Gadus morhua) in an individual heterozygous for that 
SNP. The sequence of steps is similar to that depicted in the 
accompanying figure. However, in this instance the upper strand is 
selected as the target strand and the presence or absence of a G 
nucleotide at the SNP site was determined. The G allele refers to that 
1 0 synaptophysin allele bearing the G nucleotide at position 380 and the A 
allele refers to the synaptophysin allele bearing the A nucleotide at 
position 380. 

The diagnostic primer (in this case it is the upper primer) was 
labelled with a y 32 PATP. The genomic DNA sample was amplified by 

1 5 PCR as follows: the reaction mix for PCR contained lOOng genomic 
DNA from the Atlantic cod, 0.2mM dATP, dCTP, dITP and dTTP, 
1.5mM MgCl 2 , Bioline (Bioline is a Trade Mark) Taq buffer, and 1 unit 
of Bioline Taq Polymerase, 50 ng of each primer in a total volume of 
25ul. The reaction was then placed in a thermocylcer with a heated hood. 

20 35 Cycles of 95° C for 60 seconds, 45° C for 60 seconds and 65° C for 
180 seconds were carried out in a thermocycler followed by 65° C. for 
600 seconds. The reaction mixture bearing the amplified DNA was then 
treated with Exonuclease I to digest any unincorporated primers 
remaining in the reaction mixture as follows: 5 pi of the PCR reaction 

25 with 2 units of Exonuclease I, 5mM MgCl 2 , 1 OmM Tris/HCl pH 7.5 in a 
19ul reaction volume at 37° C for 30 minutes. The exonuclease was 



WO 2004/0032J2 



24 



PCT/IE2003/000098 



subsequently heat inactivated by incubating the reaction at 80° C for 1 5 
minutes. 

The DNA mix (19ul) was then digested with 1 ul of Endonuclease 
V (0.4 p-g/ml preparation). The reaction was incubated at 65°C for 30 
5 minutes. 20 ul of formamide dye was added to the reaction, which was 
heated to 95° C for 5 minutes. 

The reaction was loaded onto a 6% denaturing gel and 
electrophoresed at 60W for 90 minutes. Following electrophoresis the 
gel was dried and exposed to a phosphor screen for 120 minutes and 

10 analysed using Storm 860 (Storm is a Trade Mark) (Molecular 

Dynamics). Analysis of the run showed that two cleavage products were 
present in approximately equal amounts. One product was 23 nucleotides 
(n) in length, corresponding to the G allele, while the second product 
corresponding to the A allele, was 26n in size, as described below. Since 

15 the 23 n and 26n product were detected in roughly equal amounts, this 
indicates that the target nucleic acid was from an individual fish 
heterozygous for the G380A polymorphism. 

The 23n product could only have been generated if an inosine 
residue was incorporated in the upper strand at position 380. Thus the 

20 detection of a 23n fragment is diagnostic of the presence of a G residue 
at position 380 on the upper strand. In the A allele, the first inosine 
incorporated on the upper strand should be the nucleotide at position 383 
on the upper strand and should result in a 26n product since 
Endonuclease V enzyme cuts the one nucleotide 3' (i.e. downstream of 

25 the inosine residue. Thus, the detection of a 26n fragment is diagnostic 
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of the absence of a G residue at position 380 on the upper strand, thereby 
indicating the presence of the other allele, namely A allele at this 
position. When a fish which was homozygous for the G allele was 
analysed, then only a 23n product was detected. When a fish 
5 homozygous for the A allele was analysed, then only the 26n product 
was detected. 

Example 3 

The method according to the invention was used to visualise the 
presence of the G (and therefore C also) content of a segment of the 
10 synaptopysin gene from Atlantic cod {Gadits morhua). This example 
describes where the upper strand is selected as the target strand and the 
position of G nucleotides was detennined. 

The upper primer was labelled with a y 32 PATP. The genomic 
DNA sample was amplified by PCR as follows: the reaction mix for 

15 PCR contained lOOng genomic DNA from the cod, 0.2mM dATP, 
dCTP, and dTTP, 0.05mM dGTP and 0.1 5mM dITP, 1.5mM MgCl 2 , 
Bioline Taq buffer, and 1 unit of Bioline Taq Polymerase, 6pmol of each 
primer in a total volume of 25u.l. The reaction was then placed in a 
thermocylcer with a heated hood. 35 Cycles of 95° C for 60 seconds, 

20 45°C for 60 seconds and 65°C for 60 seconds were carried out in a 

thermocycler followed by 65°C for 600 seconds. The reaction mixture 
bearing the amplified DNA was then treated with Exonuclease I to digest 
any unincorporated primers remaining in the reaction mixture as follows: 
5 pi of the PCR reaction with 2 units of Exonuclease I, 5mM MgCl 2 , 

25 lOmM Tris/HCl pH 7.5 in a 19pl reaction volume at 37° C for 30 
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minutes. The Exonuclease I was subsequently heat inactivated by 
incubating the reaction at 80°C for 15 minutes. 

The DNA mix (19|nl) was then digested with 1 ul of Endonuclease 
V (0.4 ng/ml preparation). The reaction was incubated at 65°C for 30 
5 minutes. 20p.l of formamide dye was added to the reaction, which was 
heated to 95°C for 5 minutes. 

The reaction was loaded onto a 6% denaturing gel and 
electrophoresed at 60W for 90 minutes. Following electrophoresis the 
gel was dried and exposed to phosphor screen for 120 minutes and 
10 analysed using Storm 860(Molecular Dynamics). Analysis of the run 
showed that cleavage products, i.e. bands visible on a gel, were present 
for each G residue in the sequence. 

The ratio of I:G ensures that in every amplification, a proportion 
of the fragments will have an I residue at at least one G position in the 

1 5 fragment, so that the resulting test sample will provide a heterogenous 
mix of DNA fragments with I residues at all possible G positions. The 
resulting amplified strands are cleaved by Endo V and the resulting 
fragments are analysed by denaturing PAGE. Analysis of the resulting 
banding pattern allows for identification and localisation of a sequence 

20 change. For example, analysis of a normal DNA fragment containing 10 
G residues will yield a PAGE ladder of 10 labelled fragments. Analysis 
of a similar fragment with a G-A mutation will result in the generation of 
a PAGE ladder of 9 fragments with the absence of a labelled fragment 
corresponding to that G residue that was mutated. Inosine in this instance 

25 is not incoiporated at the mutation site and thus cleavage does not occur. 
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Therefore a DNA fragment of this particular size is not generated. 
Alternatively if the sequence contains a mutation which results in an 
extra G residue, an extra labelled fragment will be observed in the PAGE 
ladder. The length of this additional fragment allows localisation of the 
5 sequence change. 

Example 4 

The method according to the invention was used to detect the 
presence of a G to T SNP (rs 1894702) at position 101663 of the human 
Factor V (Accession number NT_004668 ) in a patient heterozygous for 

10 that SNP. The sequence of steps is as generally depicted in the 

accompanying figure. The upper strand in this instance was selected as 
the target strand and the presence or absence of a G nucleotide at the 
SNP site was determined. The G allele refers to that Factor V allele 
bearing the G nucleotide at position 101663 (on the upper strand), and 

15 the T allele refers to the Factor V allele bearing the T nucleotide at 
position 101663 (on the upper strand). 

The upper primer was synthesised with a 5' fluorescent label 
(FAM). The genomic DNA sample was amplified by PCR as follows: 
the reaction mix for PCR contained lOng genomic DNA from the 

20 patient, 0.2mM dATP, dCTP, dGTP and dTTP, 1 .5mM MgCl 2> 10mM 
Tris-HCl, pH8.3, 50mM KCL, 0.3 units JumpStart Taq Polymerase, 
6pmoies of each primer in a total volume of 5ul. The reaction was then 
placed in a thermocylcer with a heated hood. 35 Cycles of 95°C for 30 
seconds, 60° C for 15 seconds and 72°C for 15 seconds were carried out 

25 in a thermocycler followed by 72°C. for 300 seconds. The reaction 



WO 2004/003232 PCT/1E2003/000098 

28 

mixture bearing the amplified DNA was then treated with Shrimp 
Alkaline Phosphatase (SAP) to digest any unincorporated nucleotides 
remaining in the reaction mixture as follows: 1 |Lil of the PCR reaction, 
and 1 unit SAP in a 5ul reaction volume at 37°C for 30 minutes. The 
5 SAP was subsequently heat inactivated by incubating the reaction at 
80°C for 15 minutes. The Exo I treated PCR was then diluted 500 fold, 
and lul re-amplified as follows: 0.2mM dATP, dCTP, dITP and dTTP, 
1.5mM MgCl 2 , lOmM Tris-HCl, pH8.3, 50mMKCl, 0.3 units JumpStart 
Taq Polymerase, 6pmoles of each primer in a total volume of 10pl. The 
1 0 reaction was then placed in a thermocylcer with a heated hood, incubated 
at 95° C for 180 seconds, followed by 45 cycles of 94°C for 120 
seconds, 45°C for 60 seconds, 65°C for 120 seconds, were carried out in 
a thermocycler followed by 65°C for 600 seconds. 

The reaction mixture bearing the amplified DNA was then treated 
1 5 with Exonuclease I to digest any unincorporated primers remaining in 
the reaction mixture as follows: 1 .5ul of the PCR reaction, and 2 units of 
Exonuclease I in a 5ul reaction volume at 37°C for 30 minutes. 

The reaction volume was increased to 20 ul with the addition of 
H 2 0 and was purified by centrifugation through a CentriSep (CentriSep 

20 is a Trade Mark) DNA spin column (Princeton Separations). The 

purified DNA (20ul) was then digested with 1.5 u.1 of Endonuclease V in 
the presence of 2mM MgCl 2 . The experiment was repeated with 
Endonuclease V from Archaeoglobus fulgidus and Thermatoga maritima 
with similar results) (0.4 |.ig/ml preparation). The reaction was incubated 

25 at 65°C for 30 minutes for Thermatoga maritima and 85°C for 30 
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minutes for Archaeo globus fulgidus. 1.5(0.1 of the reaction was added to 
lOul of forniamide and 0.2ul of ROX labeled size standard, which was 
heated to 95°C for 5 minutes. 

The reaction was loaded onto an ABI 3 100 Genetic Analyzer 
5 containing POP-4 (POP is a Trade Mark) matrix and electrophoresed at 
15kV for 30 minutes. Analysis of the run showed that two cleavage 
products were present in approximately equal amounts. One product was 
21 nucleotides (n) in length, corresponding to the G allele, while the 
second product corresponding to the T allele, was 23n in size, as 
10 described below. 

The 2 In product could only have been generated if an inosine 
residue was incorporated in the upper strand at position 101663. Thus 
the detection of a 2 In fragment is diagnostic of the presence of a G 
residue at position 101663 on the upper strand by incorporation of an 

15 inosine at this site and the Endonuclease V enzyme cutting the upper 

strand one nucleotide 3', i.e. downstream, of the inosine residue. In the T 
allele, the first inosine incorporated on the upper strand should be at 
position 101665 on the upper strand and should result in a 23n product 
since Endonuclease V enzyme cuts the upper strand one nucleotide 3' of 

20 the inosine residue. Thus the detection of a 23n fragment is diagnostic of 
the absence of a G residue at position 101663 on the upper strand, 
thereby indicating the presence of the other allele, namely T allele at this 
position. The 2 In and 23n product were detected in equal amounts 
indicating that the target nucleic acid was from an individual 

25 heterozygous for the rs 1 894702 polymorphism. When a patient 
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homozygous for the G allele was analysed, then only a 2 In product was 
detected. When a patient homozygous for the T allele was analysed, then 
only the 23 n product was detected. The relative intensity of the 21n 
product to the 23n product allows one to determine the relative levels of 
5 each allele in a target nucleic acid sample. This is especially useful for 
analyzing pooled DNA samples where there may be a large difference 
between the levels of each allele. 

Example 5 

10 The method according to the invention was used to detect the 

presence of a G to C SNP (rs 10427 14) at position 4251 of the human 
Beta-2 adrenergic receptor (BAR2) (Accession number NT 031801) in a 
patient heterozygous for that SNP. The sequence of steps is as generally 
depicted in the accompanying figure. The upper strand in this instance is 

1 5 selected as the target strand and the presence or absence of a G 

nucleotide at the SNP site was determined. The G allele refers to that 
BAR2 allele bearing the G nucleotide at position 4251 (on the upper 
strand), and the C allele refers to the BAR2 allele bearing the C 
nucleotide at position 4251 (on the upper strand). 

20 The upper primer (20 nucleotides) was synthesised with a 5' 

fluorescent label (FAM). The genomic DNA sample was amplified by 
PCR as follows: the reaction mix for PCR contained lOng genomic DNA 
from the patient, 0.2mM dATP, dCTP, dGTP and dTTP, 1.5mM MgCl 2 , 
lOmM Tris-HCl, pH8.3, 50mM KC1, 0.3 units JumpStart Taq 

25 Polymerase, 6pmoles of each primer in a total volume of 5 pi. The 
reaction was then placed in a thermocylcer with a heated hood. 35 
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Cycles of 95°C for 30 seconds, 60°C for 15 seconds and 72°C for 15 
seconds were carried out in a thermocycler followed by 72°C. for 300 
seconds. The reaction mixture bearing the amplified DNA was then 
treated with Shrimp Alkaline Phosphatase (SAP) to digest any 

5 unincorporated nucleotides remaining in the reaction mixture as follows: 
lul of the PCR reaction, and 1 unit SAP in a 5ul reaction volume at 37°C 
for 30 minutes. The SAP was subsequently heat inactivated by 
incubating the reaction at 80°C for 15 minutes. The PCR reactions were 
then diluted 500 fold, and re-amplified as follows: 0.2mM dATP, dCTP, 

10 dITP and dTTP, 1 .5mM MgCl 2 , lOmM Tris-HCl, pH8.3, 50mM KC1, 
0.3 units JumpStart Taq Polymerase, 6pmoles of each primer in a total 
volume of lOul. The reaction was then placed in a thermocylcer with a 
heated hood. 45 Cycles of 95°C for 180 seconds, 94°C for 120 seconds, 
45°C for 60 seconds, 65°C for 120 seconds, were carried out in a 

15 thermocycler followed by 65°C for 600 seconds. 

The reaction mixture bearing the amplified DNA was then treated 
with Exonuclease I to digest any unincorporated primers remaining in 
the reaction mixture as follows: 1 .5ul of the PCR reaction, and 2 units of 
Exonuclease I in a 5ul reaction volume at 37°C for 30 minutes. 

20 The reaction volume was increased to 20ul with the addition of 

H 2 0 and was purified by centrifugation through a CentriSep DNA spin 
column (Princeton Separations). The purified DNA (20ul) was then 
digested with 1 .5 ul of Endonuclease V in the presence of 2mM MgCl 2 . 
The experiment was repeated with Endonuclease V from Archaeoglobus 

25 fulgidus and Thermatoga maritima with similar results) (0.4 ug/ml 
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preparation). The reaction was incubated at 65°C for 30 minutes for 
Thermatoga maritima and 85°C for 30 minutes for Archaeoglobus 
fulgidus. 1.5 pi of the reaction was added to 10(il of formamide and 0.2p,l 
of ROX labeled size standard, which was heated to 95°C for 5 minutes. 

5 The reaction was loaded onto an ABI 3 100 Genetic Analyzer 

containing POP-4 matrix and electrophoresed at 15kV for 30 minutes. 
Analysis of the run showed that two cleavage products were present in 
approximately equal amounts. One product was 22 nucleotides (n) in 
length, corresponding to the G allele, while the second product 
10 corresponding to the C allele, was 26n in size, as described below. 



The 22n product could only have been generated if an inosine 
residue was incoiporated in the upper strand at position 425 1 . Thus the 
detection of a 22n fragment is diagnostic of the presence of a G residue 

1 5 at position 425 1 on the upper strand, by incorporation of an inosine 
residue and the Endonuclease V enzyme cutting the upper strand one 
nucleotide 3 1 i.e. downstream of the inosine residue. In the C allele, the 
first inosine incoiporated on the upper strand is at position 4255 on the 
upper strand and should result in a 26n product since Endonuclease V 

20 enzyme cuts the upper strand one nucleotide 3 ! of the inosine residue. 
Thus the detection of a 26n fragment is diagnostic of the absence of a G 
residue at position 4251 on the upper strand, thereby indicating the 
presence of the other allele, namely C allele at this position. The 22n and 
26n product were detected in equal amounts indicating that the target 

25 nucleic acid was from an individual heterozygous for the rs 10427 14 
polymorphism. 



